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Abstract 


The  results  of  a  simulation  study  intended  to  determine  the  circum¬ 
stances  under  which  it  is  beneficial  to  operate  a  data  base  management 
system  with  a  multi-processor  back-end  are  presented.  The  basic  concepts 
of  back-end  data  base  management  systems  and  multi-processor  back-end 
systems  are  provided  as  background  material.  The  general  structure  of 
the  simulation  model  which  has  been  implemented  in  GPSS  is  outlined.  The 
results  of  the  study  indicate  that  the  amount  of  CPU  activity  required  by 
the  data  base  management  system  is  a  determining  factor  with  respect  to 
the  need  for  a  multi-processor  back-end. 


* 
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1.  Introduction 


As  the  utilization  of  data  base  management  systems  (DBMS)  increases, 
a  proportionate  rise  in  the  demand  for  system  resources  is  observed.  Often 
this  demand  exceeds  the  capacity  of  the  central  computer  of  a  data  processing 
installation.  One  practical  and  economic  method  of  increasing  system 
capacity  without  an  expensive  upgrade  is  to  attach  a  minicomputer  to  the 
central  computer  to  share  the  data  base  load.  The  resultant  configuration 
is  known  as  a  back-end  DBMS. 

Typically,  a  back-end  DBMS  consists  of  one  processor  to  execute  applica¬ 
tion  programs  and  one  to  perform  the  data  base  functions.  It  is  possible 
to  distribute  the  data  base  functions  over  several  processors  in  a  multi¬ 
processor  back-end  DBMS.  A  simulation  study  has  been  conducted  to  determine 
the  circumstances  under  which  a  multi-processor  back-end  DBMS  provides 
performance  superior  to  a  single  processor  back-end  DBMS. 

In  order  to  provide  the  proper  background  for  the  interpretation  of  the 
simulation  results,  the  essential  concepts  and  basic  structure  of  back-end 
data  base  systems  are  presented. 

2 .  Back-End  Data  Base  Management  Systems 

The  principal  idea  of  a  back-end  data  base  management  system  is  that 
all  data  base  operations  are  performed  by  a  computer  that  is  distinct  from 
the  processor  executing  the  application  programs  requesting  the  operations. 
The  concept  of  a  back-end  DBMS  was  initially  proposed  by  Canaday,  et  al 
[1]  and  has  been  studied  by  several  other  researchers  [2-10].  The  processor 
on  which  the  application  programs  reside  is  known  as  the  host  processor. 

The  computer  controlling  the  access  to  the  data  base  is  a  back-end  machine. 
Figure  1  illustrates  the  distribution  of  software  in  a  back-end  DBMS. 
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As  shown  in  Figure  1,  the  basic  philosophy  of  a  back-end  DBMS  is  to 
minimize  the  resources  of  the  host  processor  required  by  the  data  base 
system.  Other  than  the  application  programs,  the  only  data  base  system 
requirements  are  an  interface  routine,  buffer  space,  and  a  message  system. 

The  message  system  is  a  communications  package  that  controls  the  exchange 
of  information  between  the  host  and  back-end  machines.  The  DBMS  interface 
routine  is  called  whenever  a  data  base  operation  is  requested  by  an  applica¬ 
tion  program.  The  interface  formats  the  request  into  a  message  and  then 
calls  the  message  system  to  transmit  the  request  to  the  data  base  system 
software  on  the  back-end  computer. 

The  back-end  machine  contains  all  of  the  functional  DBMS  software. 

The  actual  execution  of  the  data  base  operation  is  carried  out  by  one  of 
several  DBMS  tasks.  It  has  been  demonstrated  [11]  that  in  order  for  a 
back-end  DBMS  to  provide  any  performance  benefits,  both  the  host  and  back¬ 
end  processors  must  be  multi-programmed.  The  DBMS  control  software  on  the 
back-end  initiates  and  manages  the  DBMS  tasks.  The  back-end  processor  also 
holds  the  logical  description  of  the  entire  data  base,  the  schema,  and  the 
logical  description  of  the  portion  of  the  data  base  available  to  a  particular 
application  program,  the  sub-schema.  All  data  base  operations  are  validated 
against  the  sub-schema  and  schema  by  the  DBMS  control  software  before  being 
passed  to  a  DBMS  task  for  execution.  Figure  2  portrays  the  communication 
sequence  occurring  when  an  application  program  issues  a  data  base  request. 

The  feasibility  of  a  back-end  DBMS  in  a  data  processing  environment  has 
been  demonstrated  in  a  prior  study  [3J.  The  results  of  that  study  indicate 
that  a  back-end  DBMS  provides  the  following  advantages  over  a  single  machine 
system: 

1.  The  host  memory  allocated  to  the  DBMS  is  significantly  reduced. 


2.  The  amount  of  memory  per  application  required  in  the  host  machine 
is  reduced. 

3.  Operating  system  overhead  is  decreased  on  the  host  system  since 
many  operating  system  functions  are  assumed  by  the  back-end  machine. 

4.  By  transferring  tasks  to  the  back-end  minicomputer,  the  advantages 
of  minicomputers  are  realized — i.e.,  they  are  less  expensive  per  instruction 
to  operate  than  large  mainframes. 

5.  Integrity  is  improved,  since  the  computers  can  oversee  one  another. 

6.  Data  base  protection  is  enhanced,  since  control  of  the  data  base  is 
independent  of  the  host  computer  and  its  memory  space. 

7.  The  system  has  a  modular  organization  which  provides  direct  and 
indirect  benefits  from  the  standpoints  of  maintenance  and  software  develop¬ 
ment. 

This  study  extends  the  notion  of  a  back-end  DBMS  to  allow  several 
machines  to  perform  the  back-end  function.  This  configuration  has  been 
proposed  by  Lowenthal  [2]  and  is  known  as  a  multi-processor  back-end  DBMS. 
Figure  3  depicts  the  general  structure  of  a  multi-processor  back-end  DBMS. 

The  software  structure  of  a  back-end  processor  in  a  multi-processor 
back-end  DBMS  is  identical  to  the  structure  in  a  single  back-end  configura¬ 
tion.  The  host  processor  must  contain  sufficient  software  to  determine  the 
back-end  processor  to  execute  a  particular  data  base  operation.  As  indicated 
in  Reference  [12],  the  back-end  processor  can  be  determined  from  the  identi¬ 
fiers  of  the  data  being  accessed. 

3 .  The  Simulation  Model 

The  model  used  to  describe  a  multi-processor  back-end  DBMS  is  an  extension 
of  the  model  developed  by  Maryanski  and  Wallentine  [11]  for  a  single  processor 
back-end  DBMS.  Implementation  was  done  using  CPSS.  A  very  high-level  descrip- 
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Cion  of  the  simulation  of  a  data  base  operation  is  presented  in  Figure  4. 

The  model  can  be  parameterized  to  describe  any  distributed  data  base 
management  system  with  arbitrary  numbers  of  host  and  back-end  processors. 

In  order  to  describe  a  configuration,  the  user  must  provide  information  on 
the  intermachine  linkages,  relative  processing  capabilities,  number  of 
disks  associated  with  each  processor,  and  the  number  of  data  base  partitions 
to  be  supported  by  each  machine.  For  the  simulation  experiment  described 
here,  the  hardware  configuration  portrayed  in  Figure  5  was  modeled.  This 
configuration  is  composed  of  computers  in  the  laboratory  of  the  Computer 
Science  Department  at  Kansas  State  University. 

The  job  mix  can  be  characterized  by  the  parameter  indicating  the  amount 
of  computation  by  the  host  CPU  between  data  base  requests.  For  this  experi¬ 
ment,  the  average  values  were  set  at  1,  13,  and  130  milliseconds  to  indicate 
light,  moderate,  and  heavy  computational  activity  (or  conversely,  heavy, 
moderate,  and  light  data  base  activity).  Parameters  indicating  the  number 
of  I/O  operations  per  data  base  request  and  the  amount  of  computation  by  the 
back-end  processor  between  I/O  operations  can  be  used  to  further  character¬ 
ize  the  data  base  environment.  For  the  experiment  reported  here,  these 
parameters  were  not  varied.  The  values  of  the  parameters  were  set  to 
describe  an  environment  in  which  data  base  requests  required  very  little 
computation  in  the  back-end  and  the  majority  (56%)  of  which  did  not  require 
a  physical  I/O.  A  data  base  system  with  batch  updating  and  simple  inquiries 
fits  this  characterization. 

4 .  Results 

The  object  of  the  simulation  was  to  determine,  for  the  particular  environ¬ 
ment,  the  performance  effects  of  adding  additional  back-end  processors  to  a 
back-end  DBMS.  Performance  is  measured  in  terms  of  the  number  of  data  base 
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operations  completed  per  minute.  Of  the  back-end  processors  shown  in 
Figure  5,  Back-Ends  1  and  3  are  assumed  to  be  of  approximately  equal 
computational  power,  while  Back-End  2  is  assumed  to  be  significantly  less 
powerful.  Computational  power  was  roughly  measured  by  relative  instruction 
speed . 

The  most  important  parameter  with  respect  to  performance  is  the  amount 
of  computation  between  host  data  base  requests.  In  all  situations,  with 
various  numbers  of  host  and  back-end  partitions,  when  the  mean  inter-data 
base  requests  time  was  set  at  1  or  13  milliseconds,  a  significant  performance 
increase  was  noted  with  any  of  the  multi-processor  back-end  configurations. 
However,  with  a  mean  inter-data  base  request  time  of  130  milliseconds, 
the  performance  benefits  of  the  multi-processor  back-end  configurations  were 
insubstantial.  Figures  6-8  provide  a  sampling  of  the  simulation  results. 

5.  Analysis 

The  simulation  results  indicate  that  for  moderate  to  heavy  data  base 
I/O  activity,  a  multi-processor  back-end  DBMS  has  the  potential  for  increas¬ 
ing  performance  in  a  batch  update,  simple  inquiry  environment.  A  "typical" 
commercial  data  processing  facility  has  an  environment  similar  to  that  used 
in  the  model.  The  feasibility  of  a  single-processor  back-end  DBMS  has  been 
evaluated  favorably  in  previous  work  [3].  The  results  presented  here  indi¬ 
cate  that  if  increased  performance  is  desired  from  a  single  processor  back¬ 
end  DBMS  in  a  data  processing  installation,  the  addition  of  another  back-end 
processor  is  a  reasonable  strategy  if  the  data  base  demand  is  moderate  to 
heavy. 

Observation  of  Figures  6-8  indicates  no  clear  performance  superiority 


between  configurations  1,  2,  and  1,  3.  Other  results  not  shown  in  the 
figures  indicate  that  in  a  single  processor  back-end  DBMS,  Back-End  1  and 
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Back-End  3  give  approximately  equal  performance.  These  findings  lead  to 
the  conclusion  that  in  the  environment  simulated,  computational  speed  of 
the  back-end  machine  is  not  an  important  performance  factor.  However, 
preliminary  results  in  a  follow-on  simulation  experiment  indicate  that 
if  the  computation  time  in  the  back-end  per  data  base  operation  is  increased, 
the  computational  speed  of  the  back-end  machine  can  become  a  determinant 
of  system  performance.  In  terms  of  the  data  base  environment,  an  increase 
in  back-end  computation  per  data  base  operation  corresponds  to  an  increase 
in  complexity  of  the  data  base  operations. 

6.  Conclusion 

The  simulation  results  indicate  that  for  many  data  processing  installa¬ 
tions  the  progressive  addition  of  back-end  data  base  machines  is  a  viable 
long-term  plan  for  efficient  and  economical  data  base  management.  Further 
investigations  are  required  to  determine  necessary  capabilities  of  back-end 
data  base  computers  in  varying  data  base  environments. 
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